THE PRIME NUMBER THEOREM ON THE NOSE 
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Abstract. The prime number theorem gives the following asymptotic for the 
n-th prime: p„ ~ iL(?i), where we are calling iL the inverse to the logarithmic 
integral function, Li. Let tt{x) denote the number of primes p < x with 
p = [iL(n)J for some n. We say that these primes hit the value suggested 
by the prime number theorem "on the nose". Using exponential sums, the 
method of stationary phase, and Vaughan-type identities, we show that n{x) ~ 

~ ^^^^ ^ and interpret this fact as the independence of the process of 
the primes to their average value, iL. 



1. Introduction 

It is a folklore conjecture that any integer-valued function should contain infin- 
itely many primes, so long as no devastating obstacle presents itself (for instance, 
it would be quite extraordinary if one could find many primes in the sequence 
{n^ ~ 1}). Dirichlet answered this question completely in the case of lines, and 
none since has succeeded for a single higher-degree polynomial. Iwaniec showed 
that there are infinitely many almost primes (numbers with at most two factors) 
in the sequence {n^ + 1} and similarly in any admissible one-variable quadratic 
polynomial. Allowing two variables, there are the famous results of Friedlander 
and Iwaniec (Ij that there are infinitely many primes in X'^ + V^, and the corre- 
sponding theorem of Heath-Brown ^ for the even sparser polynomial + 2Y^. 

In the 1950's, Piatetski-Shapiro |Hj gave an interpolation between Dirichlet's 
primes in arithmetic progressions and the higher degree case via the sequence 
{[n"^]}. Here [x\ is the integer part of x and c is any constant in some inter- 
val [1, Co). Clearly there are no primes when c — 2 but it is believed that cq = 2 is 
the correct upper bound. Piatetski-Shapiro succeeded with cq = jf « 1.0909 . . . , 
and then improvements of Kolesnik [7], Heath-Brown and Rivat |H| (among 
many others) brought it to the current record, cq = « 1.1612 .. . due to Rivat 
and Sargos _10,. 

The quantitative result is of course an asymptotic formula. Let 7rc(a;) represent 
the number of primes p < x with p = \_n'^\ for some n. Then these theorems 
state that ndx) ~ While the common interpretation of Piatetski-Shapiro's 

theorem is an instance of primes in a higher order polynomial, we prefer to say that 
it demonstrates the independence of the exponential function n'^ and the random 
process of the primes. More precisely, the fraction of c-th powers to all numbers 
up to X is roughly while the corresponding probability for primes is . The 
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two events (hitting a prime or a c-th power) are independent if the probabiUty of 

l/c 

the intersection is the product of the fractions, namely ^-^^ ^ . But this is nothing 

l/c 

more than that corresponding asymptotic for the count, 7rc(a;) ~ i^;^- 

In the present work we are concerned not with improving the value of cq but with 
the opposite direction - decreasing the exponent down to a log. So we would like to 
find primes in the sequence {nlogn}. Actually, the Prime Number Theorem gives 
the asymptotic formula for the n-th prime, p„ ~ iL(n) where we are (somewhat 
facetiously) calling iL the inverse to the logarithmic integral function. 



Li(2;) 



dt 



2 logi 

The following expansion can be found for example in |12| : 

■X / N , 1 , log log a; - 2 

\L{x) — X log a; + X log log x — x + x h . . . . 

log.T 

Let 7t(x) denote the number of primes p < x which hit the prime number theorem 
"on the nose", p — [iL(n)J for some integer n. Our main result is 

Theorem 1.1. Tr{x) ^ + 0(r4-) = r4- + O(r^). 

^ ' X ^ log-' X ' log'* X ^ log'' X I 

Our interpretation is again that the sequence [iL(n)J is independent of the pro- 
cess of the primes. This is our reason for choosing to write the first equality in 
Theorem ll.il There is the same proportion of elements of [iL(n)J and primes less 

than a;, namely . Multiplying these probabilities gives the fraction and 
the count for 7r(x) as above. In some sense, this is completely expected, but in 
another sense, it is a bit surprising - the process of the primes is independent of its 
average value. 

The proof follows analogously to Piatetski-Shapiro's and uses the techniques of 
estimating exponential sums. In the following section, we reduce the problem to an 
exponential sum over the primes. We devote the third section to breaking the sum 
into ones of Type I and Type II (as per Vaughan's nomenclature) and estimating 
these individually, completing the proof of Theorem 11.11 We reserve the (tedious) 
calculations of the method of stationary phase and the necessary bound on Type 
II sums for the first two appendices. In the third appendix, we trivially extend our 
results to primes on the nose in arithmetic progressions. 

2. Reduction to exponential sums over primes 

We follow standard methods, which we include here for completeness. If p = 
[iL(n)J then p < iL(n) < p + 1, or equivalently, Li(p) < n < Li{p + 1). The 
existence of an integer in the interval [Li(p),Li(p + 1)) is indicated by the value 
[Li(p + 1)J — [Li(p)J , so we have 

^(a:) = 5](LLi(p+l)J-LLi(p)J). 

p<.x 

Write 19\ = -tjj{e) ~ ^, where the shifted fractional part ip{e) ^ {0} - ^ e [-^, ^) 
has the standard truncated Fourier expansion: (see e.g. Theorem A. 6 of ^) 

E c,ei0h) + O{^) (2.1) 

0<\h\<H 
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with <C ^, and e{x) — e'^'^" . In the above, iJ is a parameter which we wiU 
choose later (we wih eventually set H = log^ N) . 
So we have: 

n{x) = ^(Li(p + 1) - U{p)) + ^(V(Li(p)) - ^{U{p + 1))). 

p<.x 

Since Li'(x) — j^^, we use the Taylor expansion: Li(p + 1) = Li(p) + — \- 
O(^) to get: ' 

'^(^) = E + E('^(Li(p)) - ^{Uip + 1))) + 0(1). 



By partial summation and the prime number theorem, 



^-^ Ino-' 



p<x 



d-K{t) Tr{x) 

logp J 2 logt logX 



0{ 



-^dt) = ^ . - . , 
tlog t log X log X 



Therefore it suffices to show that 

^^(Li(p))-VXLib+l))« 

Equivalently, we require that: 



log X 



N 



J2 A(n)(^(Li(n))-V(Li(n + l)))« 
— ^ loe TV 

Af<n<Ari<2Af ° 

Using H2.1|l we write the sum above as S = Si + 0(S2) where 

Ei=^A(n) ^ Cft (e(/iLi(n)) - e(/iLi(n+ 1))) 

n 0<|/i|<H 

and 

n 

It is clear by the Prime Number Theorem that S2 <C N/H, so choosing H — 
log^ N dispenses with the error. 

On writing 4ih{x) = 1 — e(/i(Li(a; + 1) — Li(a;)) and by partial summation, we see 
that 



^1 < E 

l<h<H 

« E 

i</i<_ff 

+ / E '^-^ 



A{n)(l)h{n)e{hU{n)) 

N<n<Ni 



h{Ni) A(n)e(/iLi(n)) 

Af<n<Afi 



l<h<H 



d(j)h{x) 



dx 



Y Kn)e{hU{n)) 



N<n<x 



dx 



1 



max 

log N2<2N 



E 

l<h<H 



Y Hn)e{hU{n)) 

N<n<N2 
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Here we used the bounds 

(t>h{x) < h{U{x + 1) - Li(a;)) < — ^ 

log iV 

and 

« h{ i —) « 

dx ^\og{x + l) \Qg{x)' NXog^N 

for N < X < 2N. We have thus reduced Theorem II. II to the estimate: 



E 

0<h<H 



J2 A{n)e{hLi{n)) 



N<7i<N2<2N 



which we prove in the next section. 



3. Vaughan estimates 

Our goal in this section is to demonstrate 

Theorem 3.1. The left hand side of (E3 is < TV'^s/so+e, 

We now follow the outline of f!? Chapter 4.6. Fix u and v, parameters to be 
chosen later, and let F{s) = J2i<n<v Mn)n^'' and M{s) = Z]i<ri<u 
Notice, for instance that ^{s) + F{s) — — J2n>v A(r7,)n^''. Comparing the Dirichlet 
coefficients on both sides of the identity 

^ + F = + (1 - (M) + CM + CFM 

gives for n > v. 

-A{n) = - ^ A{k)J2^,{d)-^\ogk^,{l)+ J2 ^-mKm) 

kl = n d\t kl = n klm = n 

fc>i.,!>u d>u I<u l<ii,m<u 

Assume for now that v < N (we will eventually set u and v to be slightly less 
than ^/N). Multiplying the above identity by e(/iLi(n)) and summing over n gives: 

J2 A{n)e{hU{n)) = E E A{k)a{l)e{hU{kl)) 

N<n<N2<2N u<l<N2/v N/l<k<N2/l 

v<k 

+ E E Kl) \og k e{hU{kl)) 

l<u N/l<k<N2/l 

- E E Kr)eihUikr)) 

r<uv N/r<k<N2/r 

— Si + S2 — S3, 

where 

a{l) = ^ fi{d), and b{r) = ^ A{l)^{m). 



d>u l<D,m<i, 



Notice that |a(Z)| < d{l) and similarly |6(r)| < X]d|r^('^) ^ l^S''; so we have the 
estimates 
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^ |a(OP < ilog^i, and ^ \b{r)\^ <t: Rlog^ R. 

L<l<2L R<r<2R 

It now suffices to show that T,o<h<H l^il < TV^^/so+e gg^^j^ i = 1,2,3 by 
choosing u and v appropriately. We treat the sums of Si individually in the next 
three subsections. 

3.1. The sum 5*2 . Recall that 

5-2 = XI ^(^) \og k e{hU{kl)) 

l<u N/l<k<N2/l 

( rNi/l I \ 

<C > V7V7i+ / -^/xhldx] 

tuK ^^/' " / 

< ViVhu. 

by partial summation and Lemma lA.ll Thus X]i<h<H l'S'2| ^ ]\[4:9/50+€ desired) 
on taking u — N^^^^^ and recalling that H = log'^ N. 

3.2. The sum 5*1. Rewrite Si and split it into ^ log^ N sums of the form: 

Si = J2 a{k)p{l)e{hU{kl)) 

N<kl<N2 
■u<k,u<l 

< log^iV J2 H a{k)P{l)e{hU{kl)). 

L<l<2L K<k<2K 
~ N<kl<N2 

The roles of k and / are essentially symmetric (allowing a and P to be either 
A or a will not affect the final estimate) and taking w = u, we may arrange it so 
JY12/25 <K<N^ and < L < N^^/^^. 

Now using Lemma [8.21 we find that: 

Si < log^ N (^KL^^/^^h^ log^ L log^ 

« log6iv(7V49/50;^l/26^_ 

Thus J2h N^^^"^" log^^ as desired. 

3.3. The sum 5*3. Recall S3 and break it according to: 

^3 = E e(/iLi(fcr)) 

r<tfu N/r<k<N2/r 

= E+ E^" 

= »S'4 + ^5. 

We treat S4 exactly as S2, getting ^4 ^ ^/Whulogu, which is clearly sufficiently 
small. 

For 55, the analysis is identical to that of Si and gives the same estimate, so we 
are done. 
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4. Numerical Investigations 

Despite our colloquial use of "on the nose" , we should point out that p — [iL(n)J 
need not be the n-th prime; indeed most of the time this cannot be the case. 
Consider the following table, containing the primes on the nose less than 1500, 
where we show values of k and n such that the /c-th prime, pk — [iL(n)J . (A caveat 
for those looking to reproduce the table below: Mathematica's Loglntegral function 
is actually li(a:;) = dt/ logt; one must subtract li(2) before using something like 
FindRoot to compute iL.) Notice that since Tr{x) < Li{x) for the first many values 
of X (in fact for all values of x less than the first Skewes number, known to be at 
most 1.39 X lO'^^^, see e.g. it is impossible for p„ = [iL(n)J in this range (not 

counting k — 3,4). We have also included the values of Li{p) and 1/logp in the 
table to point out that a prime on the nose occurs only when the value of Li(p) is 
just less than an integer, by no more than 1 / logp w Li(]3 + 1) ^ ^Kp)- 



5. Conclusion 

As a result of the current investigation, the author became interested in the 
following question. Can one obtain a lower bound for the, say, Lebesgue measure 
of the set of t < x for which \n{t) — Li(t)| < c^/tl An affirmative answer would 
say that the Riemann Hypothesis is true for a positive proportion of values. This 
question should be somewhat analogous to Selberg's theorem that a positive pro- 
portion of the zeros of the Riemann Zeta Function lie on the critical line, though 
the author is not aware of a way to relate the two statements. It may provide 
an alternate to Density Theorems in applications. Another direction of further 
research is to attempt a theorem along the lines of Green- Tao to find three-term 
(or longer) arithmetic progressions of primes on the nose. Lastly, we also hope to 
follow Goldston-Yildirim and show that there are small gaps between primes on 
the nose and primes. In particular, we conjecture that there are infinitely many 
primes on the nose, p = [iL(n)J , such that p -I- 2 is prime. (Of course, the smallest 
gap between two primes on the nose is log n, since this is the gap in the function 
iL(n).) We hope to pursue these matters in the future. 
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k 


n 


Pk 


iL(n) 


Li(Pfe) 


U(pk + 1) 


1/logPfe 


3 


3 


5 


5.687 


2.589 


3.177 


0.621 


4 


4 


7 


7.572 


3.711 


4.208 


0.513 


7 


8 


17 


17.48 


7.831 


8.18 


0.352 


9 


10 


23 


23.508 


9.838 


10.155 


0.318 


12 


14 


37 


37.099 


13.972 


14.248 


0.276 


30 


32 


113 


113.632 


31.866 


32.077 


0.211 


33 


37 


137 


137.791 


36.839 


37.042 


0.203 


37 


41 


157 


157.769 


40.847 


41.045 


0.197 


39 


43 


167 


167.955 


42.813 


43.008 


0.195 


40 


44 


173 


173.094 


43.981 


44.175 


0.194 


44 


48 


193 


193.94 


47.821 


48.011 


0.19 


46 


49 


199 


199.221 


48.958 


49.147 


0.188 


60 


64 


281 


281.348 


63.938 


64.115 


0.177 


74 


80 


373 


373.946 


79.84 


80.008 


0.168 


75 


81 


379 


379.878 


80.852 


81.02 


0.168 


78 


84 


397 


397.767 


83.871 


84.038 


0.167 


80 


86 


409 


409.769 


85.872 


86.038 


0.166 


82 


88 


421 


421.829 


87.862 


88.028 


0.165 


84 


90 


433 


433.947 


89.843 


90.008 


0.164 


106 


113 


577 


577.032 


112.994 


113.152 


0.157 


116 


123 


641 


641.145 


122.977 


123.132 


0.154 


118 


124 


647 


647.614 


123.905 


124.059 


0.154 


122 


128 


673 


673.586 


127.909 


128.063 


0.153 


128 


135 


719 


719.407 


134.938 


135.09 


0.152 


131 


138 


739 


739.183 


137.972 


138.123 


0.151 


146 


153 


839 


839.235 


152.964 


153.113 


0.148 


149 


156 


859 


859.469 


155.93 


156.078 


0.148 


160 


168 


941 


941.096 


167.985 


168.131 


0.146 


161 


169 


947 


947.946 


168.861 


169.007 


0.145 


169 


1 (0 


1009 


1 Ann noo 
1009.923 


1 / (.000 


178.011 


U.144 


177 


184 


1051 


1051.55 


183.92 


184.064 


0.143 


183 


190 


1093 


1093.416 


189.94 


190.083 


0.142 


192 


200 


1163 


1163.701 


199.9 


200.042 


0.141 


198 


207 


1213 


1213.264 


206.962 


207.103 


0.14 


206 


216 


1277 


1277.407 


215.943 


216.082 


0.139 


210 


218 


1291 


1291.723 


217.898 


218.038 


0.139 


217 


223 


1327 


1327.61 


222.915 


223.054 


0.139 


222 


233 


1399 


1399.789 


232.891 


233.029 


0.138 


236 


245 


1487 


1487.084 


244.988 


245.125 


0.136 



Appendix A. Stationary Phase 
We require the following estimate. 
Lemma A.l. For any integer k>l, 



e(/iLi(nfc))< 

JV<n<JVi<2JV 



N 

(N\h\k)i 
. log(Arfe) 



ifh = 
otherwise. 
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Proof. The trivial estimate is <C N. Assume without loss of generality h > 0. Let 
"0 be a smooth cutoff function supported in [1,2). We apply Poisson summation 
and estimate by the method of stationary phase: 



where the Fourier coefficient 

6„i = / ?/;( — )e{hLi{xk))e{—xm)dx — N / ^{x)e{h\A[Nxk) — Nxm)dx. 
Jr N J 

In general, in an integral of the form I = J ip{x)e{(l){x))dx where is essentially 
constant, the contribution to / is negligible except at the critical points of the 

phase, (f). At such a point, Xs, say, the contribution is il){x s) ^J'-^r^^e{4>{x s)) ■ The 

unique stationary point in our situation is Xs — exp(-^), and the contribution 
to the integral is: 



bm « N^PiXs) \0giNx,k)J-^ « \ l°g(2JV/c) log(JVfe) 

V hNk otherwise 

where the range of m is determined by the support of ip: 1 < Xs < 2. Thus the 
total contribution is 



hk 



log(JVfc) 

VV(n/iV)e(/iLi(nfc))«(™)5 V - « (iVfefc)^ 

■■^ ■'^ m log(A'A;) 

los{2Nk) 

□ 

Appendix B. Estimating Type II Sums 
We first require the following estimate. 

Lemma B.l. For fixed integers K < k < 2K and I < q < Q (where K and Q are 
some parameters) define following expression 

So{q;k)^ e{h{U{lk)-U{l{k + q)))). (B.l) 

L<l<2L 

Then 



Proof. Recall ^ Corollary 8.5: Suppose that for j — 2, 3, some C > 1 and F > 0, 
we have a function / with well-behaved j-th derivatives - say / satisfies 

§<^|/(^'(^)l<^ (B.2) 

for aU X £[L,2L]. Then 

J2 e(/(0)«CF5Lhog3L, 

L<l<2L 

where the implied constant is absolute. 
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To apply this to our situation, consider ga{x) — Li(a;a). Then: 



log(xa) ' 

xlog (xa) 
,„ _ a{\og{xa) + 2) 

9a \X) — 



x'^ log'^(a;a) 



Let us take f{x) — h{gk{x) — gk+q{x)) so that 5*0 — J2i ^(/(O)- By expanding 
with respect to fc, we get: 



xlog [xk) xlog {x{k + q)) 
log{xk) - 2 (6 - 2 log(a;fc)) 



V xlog {xk) 2 fcxlog (kx) 
and 

, fk{log{xk) + 2) (fc + q) {log{x{k + q)) + 2) 



= hlq 



V x2 1og^(a;A;) a;^ log^(a;(fc + g)) 

log^ (xfc) - 6 g|(24 - 2 log^ (kx)) 



x'^log'^{xk) 2 kx'^log^{kx) 
for some 91,(72 G [0, g]. Therefore 

^'if'V M <r ''^ f 2L{log{2Lk)-2) 
y'^^"^' - -[ logH2Lk) 
^\f"{x)\ > h(L{\og{Lk)-2] 



2 " ^ ^' - 2 V log^(LA:) 



and 



T'^ - ^V"- log\2Lk) 

^| > / log^(£fc)-6 
T'^ - 6 log^(ifc) 

We can therefore apply Corollary 8.5 taking F = i^J^^^^) and C = 12Q jj^^ . 
The result is the estimate: 

as desired. 

□ 



We have used quite crude bounds here; certainly a tighter analysis will yeild 
a better exponent. Nevertheless, the estimate in the previous Lemma suffices to 
control Type II sums as follows. 
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Lemma B.2. Let a{l) and P{k) be sequences of complex numbers supported in 
(L, 2L] and {K,2K], respectively, and suppose that |a(OP < Llog^L ana 
Efc < Klog^ K. Then 

J2 a{l)l3{k)e{hU{lk)) <C KL^'^/^^h^ log^/^ ^ ^^^3/2 ^_ 

L<l<2L K<k<2K 

Proof. Let 5 denote the sum on the left hand side. By Cauchy-Schwartz, 

2 



J2Pik)eihU{lk)) 



Let Q < ii' be a parameter to be chosen later. Using the Weyl-Van der Corput 
inequality (e.g. 2 Lemma 2.5): 

2 

K + Q V- \i\ 



K<k<2K 



< 



|g|<Q ^ K<k,k+q<2K 



^k^k+q 



and the supposed estimates on the second moments of a and /3, we get: 



\SY < Llog-^i 



^ \q\<Q ^ 



xE E mhk + q)e{h{U[lk)-U{l{k + q)))) 

I K<k,k+q<2K 

« Llog^L^ J2 Y.\l^{mk + q)\\So{q;k)\ 



Q 

1<|9|<Q fc 

where So is defined by ljB.l|l . 



Q 



log^i log^if, 



Using Cauchy's inequality (that \xy\ < ^(|a;p+|2/P)) and the fact that \So{q; k)\ 



\So{~q;k + q)\, we get 



T K 

\S? « ^log^L log^X+—log^ lE 1/^(^)1' E \So{q;k)\. 
^ ^ k i<q<Q 



From Lemma IB. II we have the estimate: 

i E \So{q;k)\<s:Q^hHi 



l<q<Q 



SO we finally see that 



\S\^ « log^ L log^ K + LiK^ log^ L log^ KQ^h^ 

The choice Q — [L^/^^/i^^/^^J gives the desired result. 



□ 



Appendix C. Primes on the Nose in Arithmetic Progressions 
In this section, we show that 

■n{x;q,a) ^ 



p<x,p = a( mod g) 
P=LiL(r.)J 



(j){q) log X log X 
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for (a, g) = 1, as a; — + oo. Using 

1 -/ N / \ |l if n = a( mod q) , 

where the sum is over aU characters x to the modulus q, we see that our goal is to 
demonstrate 

^ log X log X 

where 

4'ix;x) - 5]A(n)x(n)(LLi(n + l)J-LLi(n)J) 

71<.X 

= ^ A(n)x(n)(Li(ri + 1) - Li(n)) - ^ A(n)x(n)(^(Li(n + 1)) - V(Li(n))). 

n n 

Here again ^/'(x) = {a;} — | is the shifted ft-actional part. Since xi^l) = x(^)x(0 
and |x(")l ^ li the second sum is 0( j^^^^ ) by the same analysis as in the original 
problem. So we have 

i^ix; x) = E A(n)x(n)(r^) + 0(-^). 

^ logn log^x 

When X = Xo is the trivial character, we get 4'{x; xo) — + O( jpga^ ) and this 
is the main contribution. There is no contribution to the main term from nontrivial 
characters, so we are done. 
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